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Abstract 

In this paper a class of new goodness-of-fit tests based on a charac¬ 
terization is presented. The asymptotic properties of test statistics are 
examined. The tests are compared with some standard tests as well as 
some tests based on characterizations. For each test the Bahadur effi¬ 
ciencies against some common close alternatives are calculated. A class of 
locally optimal alternatives is found for each test. The powers of proposed 
tests for small sample size are obtained. 
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1 Introduction 

The uniform distribution is one of the most used distribution in statistical mod¬ 
eling and computer science. Therefore ensuring that the data come from uniform 
distribution is of huge importance. Moreover, testing that the data come from a 
particular distribution can be easily reduced to testing uniformity. More about 
such goodness-of-fit techniques can be found in [^. 

In recent times, the tests based on some characteristic property that distri¬ 
bution posesses have become very popular. Many different types of characteriza¬ 
tions can be found e.g in [^. Some characterizations of the uniform distribution 
can be found e.g in n, m, ED. The first test for uniformity based on char¬ 
acterization that involves moments was proposed by Hashimoto and Shirahata 
in m- Their test was based on Papathanasiou’s characterization presented in 
m- Other tests based on characterizations via moments have bee considered 
in e.g.[l6], [T7] . 

One way to compare tests is to calculate their asymptotic efficiencies. In 
case of non-normal limiting distribution the Bahadur approach to efficiency is 
suitable. It has been considered in e.g m, m, ED, m- 

In this paper we propose new uniformity tests based on characterization that 
involves the moments of order statistics. We examine the asymptotic proper¬ 
ties of the test statistics. This characterization has already been used in m. 
however the nature of test is completely different. 

The paper is organized as follows. In Section [5] we present the characteri¬ 
zation and one class of test statistics based on it. Next, we give the basic of 
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Bahadur theory. In the Section [3] we state some theorems that will be used for 
calculations. Using Bahadur efficiency we compare our test with Hashimoto- 
Shirahata test, Fortiana-Grane test based on maximum correlations (see 0) , as 
well as some standard goodness-of-fit tests. In Section 0] for each proposed test 
we found some classes of locally optimal alternatives. Finally, in Section [5] we 
perform power study and present an application in time series analysis. 


2 Characterization and Test Statistics 


The following characterization by Too and Lin can be found in m- 
Theorem 2.1 Let < oo for some pair {k,n). Then the equality 

2k k(k + 1) 

E^k,n - --—rEXk+i,n+i + -f — .. w——^ = 0 

’ n + 1 [n+ l)(n + 2) 

holds if and only if F(x) = x on (0,1). 


( 1 ) 


Let Xi,X 2 ,. ■. ,Xn be i.i.d. observations having a continuous distribution 
function F. Without loss of generality we may assume that the data belong 
to [0,1] and test for uniform distribution. If we want to test that sample 
has the distribution function Fq we may apply our test to transformed sam¬ 
ple Uq (Xi), Fo (X 2 ),..., Fq (X„). 

Denote X(^a),Xi,...,x„ the a th order statistic of the sample Xi,...,Xn. In 
order to test Hg that F(x) = x, x € (0,1) we propose the following class of test 
statistics. 


n N, (^(l),Xi^,...,X, 

m+lJ ii<...<z„+i 


(n + l)(n + 2) 


n + 1 


-X, 




These statistics are asymptotically equivalent to U- statistics with symmetric 
kernels 


^miXi, Xm+l) — ^2 ^( 1 ), 


x^, 




7r{m) 


{n + l){n + 2) ’ 


where 7r(m) is the set of all permutations of numbers I, 2,..., m. 

The first projections of kernels ..,Xm+i) on X^+i under Fig are 


7Tl — 1 

4>{s) = E{^m{Xi, .., Xm+l)\Xm+l = s) = — — —F(X(i) ,... ,X™_i ) 

-I- E{X(^i~^ Xi,...,x^) - -^E{Xi^2),s,Xi,...,x^) + 7— .. w—— 
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It can be easily shown that they are identically equal to zero under null hypoth¬ 
esis. 

The second projections of ■■,Xm+i) on {Xm, Xm+i) under null hy¬ 

pothesis are 


= - -^E{X(^i^ g t Xi,...,X^-3) H- -^E{X(^i) g Xi,...,X^-2) 

m + 1 \ J, , , • m ^ m + 1 \ J, , , m 2 

1 2 
2 

(n -I- l)(n -I- 2) 

After some calculations we obtain 

=- t ( - 2 + 2(1 - t)- + ( - (1 - 

to(1-I-to)^(2-I-to) V ' 

+ (1 - trt) +to( - 2(1 - s)’"+i + (1 - tr+ 2(1 - 
+ —77^ - 7l{s < t}((l - tr - (1 - s)"*). 

Obviously it is not equal to zero. Therefore we conclude that the kernels of our 
test statistics are weakly degenerate. Hence, we have (see M) 


ut: 


oo 

E 


- 1 ), 


where Vi are the eigenvalues of the integral operator 5 defined by 

1 

Sf{t) = J (t)l,{s,t)f{s)ds. 

0 

Thus we have to solve the following integral equation 

1 

vf{t) = J f(j)*^{s,t)f{s)ds, 

0 


( 2 ) 


1 1 

with constraints / f{s)ds — 0 and / = 1. 

0 0 

Denote y{t) = f{s)ds. After differentiation the expression Q becomes 

1 

vy"it) = -2^ —E— y{t), 1 /( 0 ) =0, 2 /( 1 ) =0, f y'{t)‘^dt = l. 

m+1 J 

0 

This boundary problem can be solved numerically for particular to. 
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3 Local Bahadur Efficiency 

We choose Bahadur asymptotic efficiency as a measure of the quality of tests. 
One of the reasons is that the asymptotic distributions of our test statistics 
are not normal. The Bahadur efficiency can be expressed as the ratio of the 
Bahadur exact slope, a function describing the rate of exponential decrease for 
the attained level of significance under the alternative, and the double Kullback- 
Leibler distance between the null and the alternative distribution. We present 
brief review of the theory, for more about this topic we refer to m, [Ill- 

According to Bahadur’s theory the exact slopes are defined in the following 
way. Suppose that under alternative 

Tn ^ b{e). 

Also suppose that the large deviation limit 

lim n~^ ItiPho >t) = -f{t) (3) 

n—>-oo 

exists for any t in an open interval /, on which / is continuous and {b{0), 9 > 
0} C I. Then the Bahadur exact slope is defined as 

criff) = 2fm)- (4) 


The Raghavachari’s inequality 


ct{9) < 2K{9), 0 > 0, 


(5) 


where K{9) is the Kullback-Leibler distance between the alternative Hi and the 
null hypothesis Hq , leads to natural definition of Bahadur efficiency. Since it is 
very important for a test to distinguish close alternatives from null distribution, 
we define local Bahadur efficiency as 


e®(T) = lim 


ct{9) 


e-.o 2K{0)' 


( 6 ) 


Suppose that the alternative density functions satisfy the regularity conditions 
from ([IH], Chapter 6). Denote h{x) = ^^^^\e=o- Then it can be shown that 
the double Kullbac-Leibler distance between the null distribution and the close 
alternative can be expressed as 


2K{9)=I{g)0^+o{0^), 9^0, 


where I{g) £ (0, oo) is the Fisher information function 


(7) 



g{x,0) 


dx. 


In order to find Bahadur exact slopes we use the following theorems from 
[23] and [22] for U— and V— statistics with weakly degenerate kernels. The first 
theorem gives large deviation function / and the second the limit in probability 
b{0) that appear in (|1|). 
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Theorem 3.1 Let the kernel of U— statistic be bounded and weakly degen¬ 
erate. Let Aq be the smallest A satisfying the integral equation 

1 

J ^*{si,S2)x{s2)ds2, ( 8 ) 

0 

and suppose that Aq is a simple characteristic number of the linear integral 
operator with kernel $* acting from L^[0, 1] into T^[0,1]. Then, for sufficiently 
small a > 0, we have 


limn ^ logP{T„ > a} = , 

i=2 

where the right-hand side is a convergent series with respect to a. Moreover 

Ao 

*^2 =-7- T\- 

m(m — 1) 

Theorem 3.2 Suppose that the family of alternatives Q satisfy some regularity 
conditions from 12S\/ . Than 

^ rn(7n— J o)ge{a:i, 0 )dxida ;2 • + o(6»^), 6^0. (9) 

W- 


We now calculate the local Bahadur efficiencies for test statistics and T^. 
The tests with greater values of m have little or no practical value. 

The second projections of their kernels under Hq are respectively 


g2 g2 2 ^ 

</>t(si,S2) = y + Y “ max(si,S2) + g, 


2 ( 51 : 'S 2 ) = -^/{s < t}(s^ -2t-\-t^ - s^) + i/{s > t}(t^ - 2s + - t^^ + 


( 10 ) 


1 o 1 2 o 2 o 

775 + 77t - TTS - Trt . 

3 3 9 9 


( 11 ) 

Putting \ = V in case of m, the expression (ED becomes well-known 
Sturm-Liouville equation 


y"(t) -\- Xy(t) = 0, y{0) = 0, y{l) = 0. 

We obtain the eigenvalues Xk = fc^Tr^, k = 1, 2... Therefore the main charac¬ 
teristic value is Ai = It is interesting to note that Cramer-von Mises statistic 
has the same kernel. Therefore they have similar asymptotic properties. 
Putting A = in case of (ED, the expression ED becomes 

y"{i) + ^(1 “ 2/(0) = 0, y(l) = 0. 

Using numerical methods we obtain that the main characteristic value is 
Ai = 28.4344. 
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3.1 Competitor tests 

We compare our tests with the following tests 

• Kolmogorov-Smirnov test with test statistic 

£>„ = sup \Fn{t)-t\; (12) 

te(o,i) 


Anderson-Darling test with test statistic 


a 2 _ f ty 


0 




dt: 


• Cramer-von Mises test with test statistic 
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• the test based on maximum correlations [?1 with test statistic 


= |Q„ - 1| = I— ^(2z - n - - 1|; 

1 — 1 

Hashimoto-Shirahata test ([12) with test statistic 


= 


^ h{x„Xj,Xk,Xiy 


i<j<k<l 


(13) 


(14) 


(15) 


(16) 


where 

h{x,,x,,Xk, Xi) = ^ ((w - x,y + (w - Xkf + (x, - xiy 
+ {X, - x^y + {Xj - xiy + {Xu - Xif^ 

- i ^(max(W, Xj) - max{Xk,Xi)){mm{Xj,Xj) - min(X/c, Xi) 
+ {max{Xi,Xk) - max{Xj,Xi)){mm{Xi,Xk) - min(Xj, W) 

-I- {max{Xi,Xi) - max{Xj, Xk)){min{Xi, Xi) - min(Xj, Xfc))^ 


We calculate local Bahadur efficiencies of proposed tests and the competitor 
ones against the following alternatives: 

• a power function distribution with density function 

gi{x,9) = {0 + l)xy XG{0,1), 9>0; (17) 


• a distribution with density 

g 2 {x, 9) = 1 + 9{2x -l)x€ (0,1), 9 > 0; (18) 
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• a mixture of uniform and power function distributions 

g3(x,0) = 1 - 0 +x e (0,1); (19) 

• a second Ley-Paindaveine alternative with density function 

g4{x,9) = 1 — 07rsin7rx, x G (0,1) 0 G [0,7r“^]. (20) 

It can be shown that those alternatives satisfy regularity conditions from [22] . 

In order to calculate local Bahadur efficiencies we use the results for exact 
Bahadur slopes from [TH] and [5] for (IT^ - (fT5l) . The large deviations functions 
are respectively 

/z)(u) = —2n o(u ), a —y 0, 

/^(a) = -a + o(a^), a -)■ 0, 

5 

/qW = “ 2“^ + a 0. 


Since for Hashimoto-Shirahata test the exact Bahadur slope is not derived 
yet, we do it here. 

Notice that Cn is an U— statistic with kernel h{Xi, X 2 , X 3 , X 4 ). It is easy 
to show that the kernel is weakly degenerate. The projection of the kernel on 
Xi and X 2 is equal to 

h*is4,S2) = E{h{Xi,X2,X3,X4)\X4 = Si, X 2 = S 2 ) = 

6 

min(si,S 2 ) 2 siS 2 s\s\ 

^ 18 9 V 

According to ^ we find that the smallest A satisfies the integral equation 

1 1 

^( 51 ) = A J h*{si,S2)x{s2)dS2, j x{S2)dS2=^, 

0 0 

or equivalently 

1 

x'\si) = ^(0) = ^(1) ^ y* x{s 2 )ds 2 = 0. 

0 

This differential equation has a solution if and only if A is the solution of the 
following equation 

sin (\/Acos _ 6-\/2sin = 0. 

6^2 V 6^2 6^2/ 

The smallest positive solution is Aq = 727r^. 

In Table [T] we present the local Bahadur efficiencies of considered tests for 
alternatives (HZ])-®- The bolded numbers represent the cases where our tests 
outperformed, in Bahadur sense, the competitor ones. 
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Table 1: Local Bahadur efficiency 


Alternative 

A 

D 

7’(2) 


C 

'p(i) 

5i 

0.40 

0.54 

0.81 

0.14 

0.37 

0.73 

52 

0.5 

0.75 

0.95 

0 

0.66 

0.98 

53 ( 3 ) 

0.48 

0.74 

0.82 

0.06 

0.63 

0.94 

54 

0.49 

0.81 

0.96 

0 

0.76 

1 


Table 2: Local Bahadur efficiency for location alternatives 


Statistics Gaussian Cauchy 


^4^ 

0.96 

0.66 

Dn 

0.64 

0.81 

Qn 

0 

0 

Cn 

0.49 

1 

7 ^( 1 ) 

^ n 

0.955 

0.76 

rp{2) 

n 

0.87 

0.72 


In Table [5] we present the local Bahadur efficiency of our tests applied to 
normal and Cauchy null distribution and corresponding location alternatives. 
We use notation from m, Chapter 2). 

It is interesting to note that Hashimoto-Shirahata is locally optimal for lo¬ 
cation alternative of Cauchy distribution. 


4 Local optimality 

In this section we present a class of locally optimal alternatives for each test. 
Following theorem follows from Theorem 5 from [22] . 

Theorem 4.1 Let g{x; 9) be the density from Q. Then, for small 9, alternative 
densities 

g{x- 9) = 1 + 9{Cfo^m{x)), a; G [0,1], C > 0, 

where fo^m{x) is the eigenfunction that corresponds to first eigenvalue of the 
integral operator is a class of locally optimal alternatives for test based on 

'Y'm 
^ n ■ 

Applying this theorem we obtain that for some of the locally optimal den¬ 
sities are 


= I-b 0cos7rx, a;G[0,l], 
, m 3(2-0)0 1 

5(1),2(2:, 0) = 


2arctanJ^l-^cos7ra; 


,x G [0,1]. 


( 21 ) 

( 22 ) 


Since the locally optimal densities for are to complicated to display we 
present them in FigureElI]and[2] 








Figure 1: Locally optimal alternatives g(2),i{x,0) 



Figure 2: Locally optimal alternatives g{ 2 ). 2 {x,d) 


5 Power study and application 

The powers of our and competitors tests are presented in Figures [2E1 The 
powers are calculated using Monte Carlo simulations with 10000 replicates at 
the level of significance 0.1. 

From these figures we can conclude that for all considered values of parameter 
9 our tests outperform the competitor ones. 

Goodness-of-fit tests for uniformity have application in time series analysis. 
We illustrate this with the following example. 

Consider time series generated by process 

t , , 

Xt = sin-TT + zt, t = 1...100. (23) 

o 

We want to test that xt is Gaussian white noise and there are no hidden pe- 
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Figure 3: Powers of test for power alternatives 



Figure 4: Powers of test for g 2 alternatives 


riodicities. 

than 


Using the results from we have that if null hypothesis is true 


Yk 




k = l..,48, 


where coi = 27r^ are Fourier frequencies and I is periodogram, are the ordered 
statistics from uniform 17[0,1] distribution. Therefore we may apply our test. 
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Figure 5: Powers of test for Ley-Paindaveine alternatives 



Figure 6: Powers of test for mixture with power alternatives 


We obtain the p-value less than 0.05 and therefore we may confirm the existence 
of hidden periodicities. 
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6 Conclusion 


In this paper we proposed new goodness-of-fit tests for uniform and other contin¬ 
uous distributions based on moment characterizations. We found their Bahadur 
efficiencies under close alternatives and showed that in almost all considered 
cases our tests were more efficient than common competitor tests. We calcu¬ 
lated the powers of our tests for small sample sizes and noticed that in most 
cases our tests were more powerful. Finally we presented one application of our 
tests in time series analysis. 
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